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ABSTRACT 

Telescopes aiming to measure 21cm emission from the Epoch of Reionization must 
toe a careful line, balancing the need for raw sensitivity against the stringent calibration 
requirements for removing bright foregrounds. It is unclear what the optimal design is 
for achieving both of these goals. Via a pedagogical derivation of an interferometer's 
response to the power spectrum of 21cm reionization fluctuations, we show that even un- 
der optimistic scenarios, first-generation arrays will yield low-SNR detections, and that 
different compact array configurations can substantially alter sensitivity. We explore 
the sensitivity gains of array configurations that yield high redundancy in the nv-plane 
- configurations that have been largely ignored since the advent of self-calibration for 
high-dynamic-range imaging. We first introduce a mathematical framework to generate 
optimal minimum-redundancy configurations for imaging. We contrast the sensitivity of 
such configurations with high-redundancy configurations, finding that high-redundancy 
configurations can improve power-spectrum sensitivity by more than an order of mag- 
nitude. We explore how high-redundancy array configurations can be tuned to vari- 
ous angular scales, enabling array sensitivity to be directed away from regions of the 
«f-plane (such as the origin) where foregrounds are brighter and where instrumental 
systematics are more problematic. We demonstrate that a 132-antenna deployment of 
the Precision Array for Probing the Epoch of Reionization (PAPER) observing for 120 
days in a high-redundancy configuration will, under ideal conditions, have the requisite 
sensitivity to detect the power spectrum of the 21cm signal from reionization at a 3a 
level at k < 0.25/i Mpc -1 in a bin of Aln k = 1. We discuss the tradeoffs of low- versus 
high-redundancy configurations. 

Subject headings: cosmology: observations, instrumentation: interferometers, tech- 
niques: interferometric, telescopes 

1. Introduction 

The Epoch of Reionization (EoR) — the rapid ionization of the majority of the hydrogen in 
the universe by light from the first stars and black holes — is the most recent phase transition in 
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the state of the baryons in our universe, and yet it still remains largely unexplored. Observations 
of redshifted emission from the 21cm hyperfine transition of neutral hydrogen have the potential to 



provide unrivaled detail about this epoch (Field 1958; Madau et al. 1997 Furlanetto et al. 2006) 



Variations in this signal versus redshift and direction allow the reconstruction of a three-dimensional 
map of the evolution of the ionization state of the hydrogen. However, reaching the sensitivity to 
image the structures during reionization will require an instrument with roughly a square kilometer 



of collecting area (McQuinn et al. 2006). As a result, first-generation radio telescopes targeting 
reionization either aim to measure the global temperature change of 21cm emission during EoR 
(a task which is not sensitivity-limited), as with the Compact Reionization Experiment (CoRE) 



and the Experiment to Detect the Global EOR Signature (EDGES; Bowman fe Rogers] 2010), or 
instead aim for a statistical detection of the 21cm fluctuations generated by reionization, as with the 

the LOw Frequency ARray (LOFAR; 



Giant Metre- wave Radio Telescope (GMRT; Pen et al. 



2009 



Rottgering et al. 



2006 



the Murchison Widefield Array (MWA; Lonsdale et al. 



2009 



and the 



Donald C. Backer Precision Array for Probing the Epoch of Reionization (PAPER; Parsons et al. 



2010 



A detection of the EoR by the first generation of instruments would establish low-frequency 
radio astronomy as a powerful probe of reionization and of the high-redshift universe. 

Removing foregrounds that are orders of magnitude brighter than the signal and obtaining the 



requisite sensitivity are the primary concerns that influence the design of all 21cm instruments (Par- 
sons et aL]|2010 Bowman et al.|[2009 Paciga et al.||2010 ). Astrophysical foregrounds that interfere 
with the direct detection of a 21cm EoR signature arise primarily from galactic and extragalactic 
synchrotron and free-free emission. With the exception of leakage terms from Faraday rotated 



polarized galactic synchrotron emission (Jelic et al. 2008; Bernardi et al. 2010), all contaminants 



arising from foregrounds are thought to be spectrally smooth or faint enough not to be problematic 



(Petrovic & Oh 2010). The brightness temperatures of these foregrounds can exceed the expected 



10 mK amplitude of the 21cm EoR signal by up to five orders of magnitude (Zahn et al. 2007 



Santos et al. 2005). These foregrounds also serve as a source of noise that dominates the system 
temperature of radiometers in the 100 — 200 MHz band. 

Projects aiming to detect the 21cm EoR signal have taken a variety of different approaches, 
illustrating the breadth of parameter space available for designing such instruments. Much of 
our discussion will focus on the PAPER experiment, but our results generalize to other arrays. 
PAPER employs single-dipole antenna elements that are not steerable — a design that emphasizes 
spatially and spectrally smooth instrumental responses to facilitate calibration and foreground 



removal (Parsons et al. 2010). This approach contrasts with the large-dish approach taken by 



GMRT and the station beam-forming approach taken by LOFAR and the MWA, where dipoles are 
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added in-phase to form more focused beams prior to correlation. PAPER'S design choice favors 
elements with a single primary lobe of response horizon-to-horizon — a choice which directly limits 
the collecting area of each element. On the one hand, for a fixed total collecting area, small 
antenna elements (such as PAPER's) increase the cost of correlation and imaging, owing to the 
0(N 2 ) scaling of the number of baselines with number of antennas N. On the other hand, it is 
imperative that imperfect calibration and sidelobes associated with station beam-forming (used 
by LOFAR and MWA) not introduce spectral structure that impedes foreground removal. The 
trade-off of per-station collecting area versus primary-beam smoothness is one of the major design 
parameters that must be addressed by first-generation experiments to lead into a next-generation 
instrument such as phase-II of the Hydrogen Epoch of Reionization Arra>|^] (HERA; Comm. for a 



Decadal Survey of AfeA; NRC|[2010| and the Square Kilometer Arrajj^] (SKA). In all cases, first- 
generation experiments will be starved for sensitivity, motivating the exploration of techniques for 
improving sensitivity. 

Optimizing these design parameters for future 21cm EoR arrays requires a careful assessment 
of the trade-offs between sensitivity and facility of calibration in first-generation experiments. To 
further this investigation, we provide in ^2] a detailed pedagogical derivation of the sensitivity for an 
interferometer targeting the 21cm reionization signal. Within this section, we relate sensitivity to 
the number of repeated measurements of modes in the uv-plane, motivating sampling redundancy 
as an important metric of the sensitivity performance of an array. In £j3j we explore the conflicting 
goals of arrays aiming to characterize foregrounds and those aiming to measure the evolving power 
spectrum of reionization fluctuations. We then show how antenna placement can be used to tune 
sensitivity relative to bright foregrounds. We evaluate several antenna configurations to arrive at a 
class of configurations that maximize sensitivity to the 21cm EoR signal. Finally, in Q we discuss 
how our configuration studies, along with experience with the technical challenges of foreground 
removal and correlating many antennas, will influence the design of future experiments targeting 
the 21cm EoR signal. 



2. Sensitivity to the 21cm Power Spectrum 



Although derivations of the sensitivity of a radio interferometer to the expected 21cm EoR 
signal exist in the literature ( Morales|2005 McQuinn et al.|2006 Pen et aL||2009 ) , we aim to clarify 
the derivation and be more precise about the approximations that have been made implicitly in 
previous derivations. The goals of this section are to develop a framework for comparing sensitivity 
to foregrounds that are often related in Jy units, to highlight the effects of wide fields-of-view 
and wide bandwidths on the approximations that are made, and to be as clear as possible about 
Fourier transform normalizations while deriving the sensitivity of an interferometric baseline to the 
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three-dimensional power spectrum of 21cm reionization. 



2.1. Single-Baseline Response 

We begin with the basic definitions for the 3D power spectrum of brightness temperature 
fluctuations (the statistic which 21cm efforts aim to measure) and for the visibility (the fundamental 
observable of an interferometer) . We then calculate the response of a single interferometric baseline 
to the 21cm brightness-temperature fluctuations arising from cosmic reionization, thereby deriving 
the relationship between visibilities calibrated to a Jy scale and reionization fluctuations in fc-space 
expressed in mK 2 units. For this derivation, we adopt a Fourier transform normalization convention 
that is consistent with that used in theoretical models (and is standard in cosmological work), but 
which differs from that used in radio astronomy. With respect to the brightness temperature in a 
pixel of the sky plane/frequency data cube, T(x), and its Fourier dual T(k), this convention yields: 



f(k) = i J T(x) e' it3 d 3 x 



Here, V refers to the volume of the observed data cube and a; is a 3D vector that indicates direction 
on the sky and depth (the frequency dimension) within the field. Likewise, k is a 3D wave-vector 
with projection k± = (k x ,k y ) in the plane of the sky, and k z along the line-of-sight (frequency) 
direction. We derive our response in the flat-sky approximation (as discussed below) so that we 
may take x to be cartesian. 

It follows from this convention that an estimate of the power spectrum is given by 

P(k) = (\f(k)\ 2 ) = I m e~^d 3 r, (2) 



where angle brackets denote an ensemble average, r is the vector distance between two points, and 
£(f) is an estimate of the two-point correlation function of the measured T, given by 

£(r) = ^ f T(x)T(x + r)d 3 x. (3) 



It is important to note that the Fourier transform normalization defined by equation [2] is not 



consistent with the transformation defining the visibility for a single baseline (Morales & Hewitt 



2004); a volume factor, V, divides the integral in equation [2j but no observing volume appears in 



the denominator of the following definition of the visibility adapted from Thompson (1999), their 
equation (2-21): 

V(u,v,w,u)= f = ^ m =A(l,m,v)I(l,m,v) 
J v 1 — i — Tn 

^ g— 2iri(ul+vm+w[\/ 1— P— m 2 — 1]) ^\ 
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where v is spectral frequency, I — sin 6 X and tn = sin By are (in the small-angle approximation) 
angular coordinates in image domain, (u, v,w) = b/X are the east-west, north-south, and line- 
of-sight projections of baseline vector b toward a phase center, in units of observing wavelength 
A, A(l, m, v) is a windowing function describing the field-of-view and bandpass response of an 
interferometer pair, and 1(1, m, v) is the specific intensity. 

It is common to neglect the (I, m)-dependence of the w-term in the exponential — a simplifica- 
tion commonly referred to as the flat-sky approximation ( Clark|l9 99). This approximation is valid 



within a radius ~ 10° of the phase center or when phasing to a direction orthogonal to the baseline 
vector (so that w ~ 0). For many low- frequency arrays, including PAPER, wide fields-of-view make 



this approximation invalid, and proper imaging requires techniques such as W-projection (Cornwell 



et al. 



2003). However, the magnitude of the 21cm EoR power spectrum P2\{k) is not expected to 
evolve significantly over fe± |/cj_| for baselines shorter than 300 meters. Hence, the linear combina- 
tion of k- modes generated by the point-spread function (PSF) of a baseline^] is still representative of 
the statistical distribution described by P2i{k). We adopt the flat-sky approximation for simplicity, 
but use the full area of the primary beam to estimate sensitivity. 

To relate Fourier transform conventions that differ by a factor of integration volume, we extend 
the definition of the visibility in the flat-sky approximation to include a similar Fourier transform 
along the frequency axis: 



V(u,v,r/) ~ J dl dm dv A(l,m,u)I(l,m,u) 

x e -2iri(u l+v m+ri u) /g\ 

This definition ignores the frequency-dependence of (u, v) arising from the changing length of 
A dividing the physical separation of two antennas. 21cm EoR experiments have large relative 
bandwidths, with (it, v) varying by as much as 4% over a 6-MHz bandwidth at 150 MHz. The 
4% variation in the k± component of k that arises from the frequency-dependence of (u, v) is 



smaller than the averaging interval used later in £2.3, and approximation does not substantially 
affect sampling of Pzxik), nor does it change the sensitivities we derive. Of greater concern in our 
examination of the frequency-dependent sampling of the itu-plane by a single baseline is the effect 
it may have on foreground removal. We will revisit this issue briefly in <33l but we defer a detailed 



7 From the perspective of deriving the power-spectrum response of a single baseline, the dominant effect of violating 
the fiat-sky approximation is that the fringe-pattern of the baseline (which is a sinusoid in I, m) gradually de-tunes 
from a cartesian sinusoid away from phase center. As a result, a Fourier mode at (u, v) that is sampled by a baseline 
will have a PSF in fc-space that is peaked in k±, but which includes contributions from modes with smaller |fcx| that 
project onto the fringe pattern nearer to the horizon. The degree of peaked-ness depends on the relative gain of the 
primary beam within the region where the flat-sky approximation is valid. For nearly all of the fc-mo des a cce ssib le to 
21cm EoR instruments, \k\ is dominated by the line-of-sight component k z 



17 



and 



18 



that 



It follows from equations 

for line-of-sight scales arising from a 6-MHz bandwidth, the k± compo nent arising from a 300-m baseline perturbs 
|fc| by Alnfc < 0.5, falling within the fiducial averaging interval used in S 2.3 This perturbation decreases rapidly for 
shorter baseline lengths. 
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treatment of the subject to a future paper. 

Squaring both sides and using / = 2k-QT/\ 2 , with A being the mean wavelength over the 
sub-band used in the Fourier transform, yields 



V 2 (u,v,r]) 



dl dm du dl' dm du' 



x A(l, m, u)T(l, m, v)A(l , m , v)T(l , m , v ) 



x e 



(l—l')+V (m-m')+r] {v-v')\ 



(6) 



We now make the approximation that A(l, m, v) is a top-hat windowing function. Explicitly inte- 
grating A(l, m, v) determines the width and shape of the convolution kernel in equation 11 Since 
the width of this kernel is thereafter neglected and only enters later to tally the number of inde- 
pendent wave-modes sampled, the top-hat approximation should be considered purely pedagogical. 
Drawing A(l,m,v) into the bounds of the integral yields: 



V 2 (u,v,r]) 



,e,B) 
dl dm dv 

(0,0,0) 



,e,B) 
dl' dm' dv' 

(0,0,0) 



x T{1, m, v)T(l', m', u ') e -^iHi-n+< m -m')+v(u-u')] ^ 



(7) 



where 6 = vO, for primary beam field-of-view fi. Changing variables so that (l r ,m r ,v r ) = (I 
I' , m — m! , v — v'): 

2 

t2, 



V (u,v,ri) 



2k B 
A 2 



■(0,0,0) 

dl r dm r dv r 
\-9-e-B) 
Ae,e,B) 
+ / dl r dm r dv r 

^(0,0,0) 



+l r fi+m r ,B+v r ) 

dl dm dv 

(0,0,0) 

\e,B) 

dl dm dv 

(l r ,m r ,v r ) 



x T(l,m,v)T(l-l r ,m-m r ,v-v r ) e -^i(ui r +vm r+V v r ) _ 
Integrating over (l,m,v) and using equation [3] yields: 



(8) 



X £ 21 (Z n m r , Ur ) e -^(nlr+v mr+V u r ) ^ 



,B) 

dl r dm r dv r 

-e-B) 



(9) 



where B is the observing bandwidth, and where we now use the subscript "21" to make explicit 
that these quantities are derived for 21cm emission from reionization. Using X and Y to repre- 
sent conversion factors from angle and frequency to comoving distance, respectively, we substitute 
(Xl r , Xm r , Yv r ) for (r x ,r y ,r z ) and (Xk x , Xk y ,Yk z ) for 27r(u,v,r]). The factor of 2ir follows from 
the cosmological Fourier convention. This substitution then yields 



V 2 \(u,v,ri) 



2q g f-{xe,xe,YB) _ 

6i(r)e" 
(-xe-xe-YB) 



A 2 J X 2 Y 



ik-r d 3 r 



(10) 
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This equation establishes the relationship between a (u, v, ?])-mode measured by an interferometer 
and a /c-mode. Hereafter we will use "(u, v, r^)-mode" and "/c-mode" interchangeably to refer to a 



coherent region in Fourier space. Because the right-hand side of equation 10 is the Fourier transform 
of £(f) with a top-hat window, in Fourier space it becomes the convolution of the Fourier transform 
of these functions: 





\ 2 QB r 




/ X 2 Yl 



P 2 i(k) * {s\nc(2X0k x ) 



smc(2X6k y ) siTic{2YBk z )) 



(11) 



where '*' signifies convolution in A;. In the more general case this convolving kernel is not a sine 
function, but the Fourier transform of the primary beam A(l,m, v). For primary beam responses 
larger than 30 arcmin, the width of the kernel in /c-space is much smaller than the scales over which 
P2i(k) varies for the fc-modes that are likely not to be dominated by foregrounds (McQuinn et al. 
2006). Thus, we drop the sine kernel from equation 11, giving: 



VLB 



V A 2 J X 2 Y 



P2i(k). 



(12) 



Theoretical studies often express the 21cm signal in a dimensionless manner given by A 2 (k) = 
^P(fc) (using that P^i(k) is expected to be nearly isotropic; McQuinn et al. 
useful to write equation [12] as 



2006), making it 



Vix(u,v,rj) 



I A 2 J X*Y fc3 ^ [k) - 



(13) 



2.2. Single-Baseline Sensitivity Measuring One fc-mode 

The next step towards estimating the sensitivity to the 21cm signal is to calculate the power 
spectrum of the thermal noise of an instrument. Thermal fluctuations produce a white- noise signal 
with root-mean-square (RMS) brightness temperature T^rms; which in practice will be roughly 
equal to the sky temperature for 21cm instruments. The thermal noise contributes a component 
to the RMS amplitude of the visibility Vn equal to: 

Vn = -^-T^ timB UB. (14) 

This equation can be derived from equation [4] assuming a white-spectrum thermal noise for / with 
temperature T^rms- We substitute Vn for V in equation 13 to get the noise contribution^] to the 



When squaring V in equation 



13 



it is important to construct an estimator of A2i(fe) that is not biased by 
the noise power spectrum. This can be accomplished by subtracting off a measured noise power spectrum, or more 
elegantly by constructing cross-products ViVj from pairs of samples i,j that measure the same Fourier mode but 
have independent thermal noise. The sensitivities that are derived here reflect the residual error that remains in an 
unbiased construction of v. 
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dimensionless power, A 2 <j (k), yielding: 



A 2 N (k) « X 2 Y^nBTl IIDS (u,v, V ). 



(15) 



Since there are 2Bt independent measurements of the noise for time t, the value of T^rms noise 
that should enter in equation [15] is not the true temperature at any given time (which is usually 
called the system temperature T sys ), but rather the error in how well T sys can be measured (which 
relates to the error in how well thermal noise can be measured and subtracted off, and is \^2T sys 



for Gaussian random noise) or rms 



T 2 /Bt. With this substitution, 



A «'^ x2Y &"t T i- 



(16) 



where t is the integration time for sampling a particular (u, v, ?7)-mode, and the factor of two in 
the denominator comes from the explicit inclusion of two orthogonal polarizations to measure the 
total unpolarized signal j^] This equation differs from the derivations given in Morales (2005) and 
McQuinn et al. ( 2006[ ) by only this polarization factor. Note how the power-spectrum sensitivity 
toward a particular /c-mode is independent of bandwidth, and that (Furlanetto et al. 1 |2006| ) 



giving us (for Q m = 0.27): 



X « 1.9 



Y « 17 



X 2 Y 



1 + z 
10 

1 + z 
10 



0.2 



Mpc 



arcmm 



0.15 



Mpc 
MHz : 



540 



1 + z\ - 9 h- 3 Mpc 3 



10 J sr-Hz 

Substituting for X 2 Y at z = 8.5 (assuming observations at 150 MHz) in equation 
fiducial PAPER parameters, we have: 



16 



(17) 
(18) 

(19) 

and choosing 



Agr(Jfe) w 2.8 x 10 4 



A: 



-sys 



0.1/i Mpc" 1 
2 r 120 days 



n 



500 K 



ays 



0.76 sr 

u\ 



20 



(20) 



where we assume 120 days of observation with a baseline of length \u\ ~ 20 that allows ~ 13 
minutes of integration per day, for a total integration time of 9 x 10 4 seconds per (u, v, ??)-mode. 



9 As defined above, A^{k) indicates the noise left in the map after one tries to subtract the noise power using 
all of the available information. It may be defined equivalently as relating to the signal-to-noise at which the true 
power spectrum, A|i(fc), can be measured in a fc-bin: SNR = A2i(fc)/A§f(fe). This definition assumes that is 
calculated for a real-valued sky, so that baselines sampling positive and negative Fourier components are not counted 
as independent measurements. 
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In general, integration time per mode, per day depends strongly on baseline orientation and the 
latitude at which an array is deployed. We will estimate a minimum integration timescale here 



for arrays at mid-latitudes, and defer an exact, configuration-dependent treatment until {2.4 and 
*|3j We compute the amount of time a baseline samples a (u, v, r/)-mode per day, i pe r_mode) as it is 
limited by the timescale for earth-rotation to move the sampling of a baseline a distance of fi -1 / 2 
in the nu-plane: i pe r_mode ~ l/v^^eM; where u; ffi is the angular speed of the earth's rotation. The 
choice of 20 wavelengths as a fiducial baseline length is arbitrary, but represents an estimate of a 



minimum baseline length that is not dominated by galactic synchrotron emission (see £3.2). 



The cosmological 21cm signal is typically much smaller than the noise in a single baseline, as 



given by equation 20 This assumption is reflected in our derivation by the absence of sample vari- 
ance as a significant contribution to the errors we compute. The globally averaged spin temperature 
of the 21cm transition is (T21) = 28 ([1 + z]/10) 1//2 mK for neutral intergalactic medium, assum- 
ing that the spin temperature of the 21cm transition is much larger than the cosmic microwave 



background (CMB) temperature (which will almost certainly hold at z < 10; e.g. Furlanetto et al. 



2006). For a patchy reionization process, an estimate for the dimensionless power spectrum of 21cm 



fluctuations arising from inhomogeneities in the ionization fraction is given by 



A 2 



(T 2 i) 2 (x H - x 2 H )/ln(k max /k 

min ) > 



(21) 



where xh is the average neutral hydrogen fraction and. & m j n and & max are the wave- vectors between 
which most of the power lies. Consistent with this estimate, when xjj ~ 0.5, simulations of 



reionization find 10 < A|i < 100 mK 2 with a flat spectrum over 0.1 < k < 10 h Mpc 1 (McQuinn 



et al.|2007|[Trac Cen|2007| ). Models with rarer sources tend to produce larger ionized regions and 



more power than those with more abundant sources (McQuinn et al. 2007). Comparing equation 



21 to the sensitivity of a baseline in equation 20 motivates the exploration of methods for bolstering 
the sensitivity of instruments to the 21cm EoR signal. 

Before proceeding, it is worth reiterating the assumptions that went into the previous deriva- 
tion, and to consider any generality that may have been lost: 



1. We assumed we could work in the flat sky limit, which we justified by noting that the 21cm 
EoR power spectrum P2i(k) is not expected to evolve on the scale of the mode-mixing intro- 
duced by this approximation. 

2. We ignored the frequency dependence of the (u, v )-coordinates of a baseline for the same 
reason. For baselines longer than ~ 300m, both of the above assumptions break down and 
cause errors at the several percent level. 



3. 



We pedagogically treated the antenna primary beam as top-hat function, but argued that 
any primary beam with FWHM wider than 30 arcmin creates a sufficiently small convolving 
kernel in equation 11 that its shape may be neglected. 



- 10 - 



4. We have assumed the SNR in any individual A;-mode measurement to be much less than 
unity. Since any additional improvement to sensitivity comes from independent fc-modes 
whose inclusion beats down both thermal fluctuations and sample variance, this assumption 
in effect allows us to ignore sample variance as a significant source of error. 



5. Finally, our value of observing time per mode in equation 20 represents a lower bound; 
its exact value generally depends on baseline orientation and array latitude and must be 
computed explicitly for specific EoR experiment locations and configurations. 



2.3. Combining Independent A;-mode Measurements 

With the sensitivity of one baseline to one /c-mode derived, we now turn our attention to the 
sensitivity boost that comes from combining multiple baselines. In this section, we consider an 
analytically tractable case, where each baseline measures an independent /c-mode. Statistically 
independent /c-modes can be combined to improve sensitivity proportionally to the square root of 

1/2 

the number of samples, N s . We ignore sampling redundancy — the possibility that many baselines 
can measure the same A;- mode — which closely approximates the response of minimum-redundancy 



arrays used for imaging (see £3.1). Although somewhat contrived, this example demonstrates how 
several different sensitivity boosts that arise from system and observing parameters. In { 2.4 we 
will use numerical simulations to calculate the sensitivities of real array configurations, including 
sampling redundancy. 

Several assumptions are used to make this derivation tractable. The final expression derived 



in this section — equation 25 — is not intended to be generally applicable, but rather to illustrate 



the different effects that come into play when combining measurements. The fully general case is 



presented in equation 27, where one must numerically calculate the effects of array configuration. 
Our principal assumption is that our baselines uniformly sample the uv-plane within a radius « m ax- 
As before, we also assume that our baselines are short enough to neglect to contribution of k± 
to k, generally true for baselines under 300m. Finally, using PAPER as a model, we assume an 
array at A5°N/S latitude observing for six sidereal hours per day (t per _day below) during which 
colder patches of the synchrotron sky are overhead. Since we assume no sampling redundancy, it 
is irrelevant whether these six hours of observation are spent tracking a single phase center or are 
broken up into several observations with different pointings. 

Before we discuss the different sources of independent fc-mode samples, we define a fiducial 



measurement which all improvements are relative to. In this section, we use equation 20 the 
sensitivity of one baseline measuring one A:-mode as our benchmark. We will refer to the new noise 
level after combining measurements as A^(fe) and express this value relative to our fiducial value, 

We now will outline the different sources of independent /c-mode samples and present physical 
arguments for their dependencies on various parameters. A full derivation, including the prefactor 
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of the final sensitivity result in equation 25 is presented in Appendix |A} 



1. Multiple line-of-sight samples One source of independent /c-mode samples comes from 
the many line-of-sight Fourier modes measured by a single baseline; since A| 1 (fc) is expected 
to evolve on log-k scales, data may be binned in equally-spaced Aln k intervals, so that 
N s (k) oc k. For example, with 6-MHz observing bandwidth, B, a single baseline will sample 
k ~ O.O6/1 Mpc -1 once, k ~ 0.12/i Mpc -1 twice, etc. This linearly increasing number of 
independent samples versus k produces a SNR oc fc 1 / 2 scaling. The number of samples within 
a bin is also dependent on the bin size, giving rise to a final proportionality after combining 
line-of-sight modes: 



A 2 



(k) oc 



1 


2 


1 


2 


1 


k 




B 




Aln A; 



A 2 



,o(*0 



(22) 



2. Multiple time samples Another source of independent measurements comes from the num- 
ber of time bins available for measuring A| 1 (A;) in a sidereal day. These additional samples 
grow linearly with the daily observation length, £ pe r_day (Accumulating samples over multiple 
days was already accounted for in equation |20|) Therefore, the sensitivity increases as: 



A 2 
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^per_day . 

3. Multiple uv-plane samples A final source of independent samples comes from baselines 
sampling different regions of the uv-plane. The most straightforward way these samples 
affect the sensitivity is through adding more baselines. Each baseline is an independent 
measurement, so sensitivity grows as the square root of number of baselines, or, linearly with 
the number of antennas, N. 

Secondly, we need to add up all the measurements across the ra-plane. Using our assumption 

that ^-samples are uniformly distributed within a circle of radius ii max in the uw-plane, 

we integrate contributions from rings of constant \u\ up to a distance \u\ = u max . This 

integration is simplified by noting that each ring of constant \u\ contributes equally to the 

sensitivity of the final measurement; as \u\ increases, the reduction in coherent integration 

time per (u, v, r/)-mode is offset by the increasing number of baselines sampling within that 

. 1/2 

ring. Integrating a constant sensitivity contribution versus \u\ gives rise to a Um ax term in 
the resulting residual noise estimate for minimum-redundancy arrays. 

There is also a factor of solid beam angle fi -1 / 4 , which is a combination of two factors. First, 
there is a decrease in integration time per (u, v, r/)-mode associated with a broader primary 
beam, which scales as the width of the primary beam, fi -1 / 2 . This term is somewhat offset by 
a second factor: the increased number of independent modes sampled, contributing a factor 
of ri 1 / 4 to sensitivity. The result is: 
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Combining all these different gains from binning the data with the prefactor calculated in 
Appendix [A] yields the final result of this section: 
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2.4. The Sensitivity Benefits of Redundant nu-Sampling 

As mentioned above, sensitivity to A|i(A;) depends on both the sensitivity obtained in indi- 
vidual (u, v, ?7)-mode bins and the number of bins sampled. As discussed in the context of CMB 
analysis, sensitivity is most efficiently improved by integrating coherently on select modes until 
SNR ~ 1 is obtained, whereupon sampling additional modes to beat down sample variance in 



the cosmological signal becomes the most efficient way of improving sensitivity (Park et al. 2003) 



Equation 20 suggests that the PAPER experiment, along with many other first-generation 21cm 



EoR experiments (McQuinn et al. 2006), will be firmly in the SNR < 1 regime for individual 
modes for the near future. As a result, it is natural to explore how sensitivity might be improved 
by choosing antenna configurations that maximize the degree to which nu-bins are sampled by 
multiple baselines. 

In this section, we outline a formalism for computing the sensitivity boost of a generic antenna 
array, expressed in terms of a redundancy metric that tracks sensitivity relative to a fiducial mea- 
surement, which we choose to be a single baseline with a one-second integration. The choice of 
fiducial integration time is arbitrary, but affects the scaling constants in the equations that follow. 

Next, we define a metric for the sampling redundancy generated by an array, 

Y - (26) 

where /o is the sampling redundancy of a single baseline with a one-second integration and rii 
represents the number of one-second samples falling in uv-bin i. The ratio f/fo measures the 
increase in sensitivity for a redundant array over one in which there is no sampling redundancy. We 
motivate the definition of sampling redundancy in Appendix [B] and show that sensitivity increases 
as \fJTh- 

The instantaneous (single- integration) redundancy of an array ranges from /o to Nfo, where 
N is the number of antennas in an array configured for the densest non-overlapping packing of 



antennas possible in two dimensions: a filled circular aperture; we derive this case in Appendix B.3 



However, computing the instantaneous redundancy of an array does not account for any additional 
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redundancy that may be generated through earth-rotation synthesis. A baseline sampling a non- 
redundant uv-bin can, some time later, migrate in the uu-plane to sample a bin already sampled by 
another baseline. Generally, the redundancy generated through earth-rotation synthesis depends 
strongly on antenna configuration. We will rely on numerically computed redundancies for specific 
configurations. 

We include the effect of sampling redundancy by using / defined for one-second integrations 



toward a transiting phase center, calculated from equation 16 Our result in equation |27| (which is 
derived in full in Appendix [B]) , is expressed in terms of fiducial observation and array parameters. 
Unlike equation [25} the effects of array configuration are now captured in the computed value for 
/: 
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The value of ///o varies substantially with N and for different antenna configurations. An array 
without redundant sampling will have ///o = 1. A nominal value of ///o = 3.4x 10 , representative 
of the antenna configurations considered later, yields Aj^(£;) ~ 33mK 2 at k = 0.1/iMpc -1 . As 



described in {2.3, we assume six-hour observations. For PAPER, these observations are phased to 
transit pointings separated by two hours and are accumulated into separate (u, v, r/)-bins for each 
pointing. (Two hours corresponds to the approximate width of the PAPER primary beam, after 
which a new, statistically independent region of sky dominates the data). Since there can be no 
redundancy between samples from different pointings, this has the effect of somewhat reducing /. 
Generally, / accounts for most effects relating to observing strategy. 



3. Antenna Configuration Studies 

Designers of interferometric arrays for sonar, radar, and radio astronomy applications have 
long appreciated the necessity of carefully choosing the physical placement of array elements to 
produce desirable samplings of the uv-pl&ne. One of the most popular criteria — the minimiza- 
tion of image-domain sidelobes arising from incomplete sampling — motivates array designs that 
maximize the number of independent Fourier modes sampled, or equivalently, minimize the redun- 
dancy with which txf-pixels are sampled. Such minimum-redundancy configurations are valuable 
for characterizing point-source foregrounds to the 21cm EoR signal, since each it-u-pixel provides 
unique information for constraining the image-domain distribution of flux density. All sampled 
Fourier modes contribute to each image-domain location, making sensitivity independent of an- 
tenna arrangement within a fixed maximum baseline length for image-domain measurements. This 
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gives rise to the traditional adaptation of the radiometer equation for interferometers (see Wrobel 
& Walker|p99l their equation 9-23): 

T Q 

ZmsQs = \J BtN(N - 1) (28) 
where f2 s , the angular size of a synthesized beam, is implicitly related to maximum baseline length. 

In contrast, the sensitivity of Fourier-domain measurements do depend dramatically on array 
configuration. First-generation experiments will constrain the power-spectrum of 21cm EoR fluc- 
tuations by sampling /c-modes accessed via spectral structure in sampled itu-pixels. As discussed in 



£2.4 sensitivity-limited arrays will do best by redundantly sampling a select number of uv-pixels. 
However, maximum-redundancy array configurations run directly counter to the needs of image- 
domain work, and will look counter-intuitive to those familiar with standard minimum-redundancy 
array configurations. 



In £2.1, we discussed how a single baseline measures A^A;) at a range of /c-scales with the 
approximation that (u, v) are not frequency dependent, and argue that this approximation does not 
dramatically affect response to the 21cm EoR signal. The impact of frequency-dependent sampling 
on foreground response is somewhat more concerning, and has been used to argue for configura- 



tions that produce uniform sampling of the uu-plane (Bowman et al. 2009). Such sampling could 
permit chosen uv-modes to be sampled continuously versus frequency, even if the baseline sam- 
pling them changes. In a future paper, we will explore in detail the effects of frequency-dependent 
uu-sampling, showing that for baselines shorter than ~ 100 wavelengths, all but the smallest k- 
modes are accessible using the inherent frequency-dependent uf-sampling produced by a baseline. 
This forthcoming result contrasts with the view that 21cm EoR arrays must produce uniform uv- 
coverage and motivates the exploration of other maximum-redundancy array configurations. With 
an eye toward using the inherent frequency-dependent sampling of each baseline independently to 
sample A^/c), we largely ignore the frequency dependence of array sampling in the discussion of 
maximum-redundancy arrays; redundant sampling will be redundant at all frequencies. 

Both minimum- and maximum-redundancy configurations have valuable properties for 21cm 
EoR work. Array configurations aiming to incorporate aspects of both must attempt to strike 
a balance between their opposing influences. Where this balance lies depends on the relative 
immediacy of sensitivity and foreground-removal needs. Given our current ignorance of many 
foreground properties, it is most straightforward to consider each type of configuration separately, 
as we will below. 



3.1. Minimum- Redundancy Array Configurations 

Designing a minimum-redundancy antenna configuration reduces to choosing a real-valued 

~ i ~ 1 2 

sampling function A(x, y), with Fourier dual A(u, v), such that J \A\ du dv is minimized. This 
optimization problem is usually discretized by sampling the aperture plane on the scale of the 
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aperture of a single antenna element, and by assuming A(x, y) to be unity-valued at a location 
containing an antenna element and zero- valued elsewhere. We are often interested in dense packings 
of antennas that also minimize the maximum distance of uv-samples from the origin. Compact 
packings of antennas have the desirable property of sampling nearly all Fourier modes for a targeted 
angular resolution. Compact minimum-redundancy configurations can also be trivially scaled to 
larger physical spacings to sample smaller angular scales. 

A fact that may be under-appreciated in the radio astronomy community is that this optimiza- 
tion problem has may parallels with Golomb rulers ( Sidon|1932 Babcock||1953 ), Golomb rectangles 
(Robinson 1985), and Costas arrays (Costas 1984) — mathematical constructions originally moti- 



vated by radar and sonar applications. Investigation of Golomb rulers and Costas arrays are active 
fields of mathematical research with interesting applications ( Golomb Sz Gong|2004 ). In particular, 
the study of Costas arrays (N x N matrices with elements chosen such that no two elements share a 
row or column and such that the displacement vector between each pair of elements is unique) has 
yielded algorithms for generating minimum-redundancy arrays where N is near a prime number 



(Golomb & Taylor 1984). For generating array configurations, directly computing antenna loca- 



tions following construction algorithms for Costas arrays is a vast improvement over the iterative 
optimization approaches presented in the literature ( Keto|[l997 de Villiers||2007 ) . 

Although Costas arrays do not quite capture the full minimum-redundancy optimization prob- 
lem (they omit samplings along the u and v axes and they do not attempt to optimize how compactly 
antennas are placed), they do sample approximately one-quarter of the available uv -plane without 
redundancy. This filling fraction exceeds what has been demonstrated with other approaches in the 



literature. For comparison, we examine the dithered Reuleaux-triangle approach favored by Keto 



(1997) for generating configurations with uniform uu-coverage, scaled to the size of an equivalent 
Costas array (in this case, for A = 24) to remove scale-dependence in the redundancy metric. As 
we show in Figure [TJ this configuration redundantly samples 28 locations with its instantaneous 
zenith-phased uu-coverage. For imaging point sources in the high-SNR limit, this configuration 
loses 6.7% of the information accessed by a roughly equivalent configuration derived from a Costas 
array. 

As an example of a larger-sized minimum-redundancy configuration derived from a Costas 
array, we produce the N = 36 antenna configuration shown in the upper-left panel of Figure [T] 



following the Welch construction (Golomb & Taylor 1984), where A is chosen to be one less than 
the prime p = 37. According to this construction, we choose an integer a = 35 with the property 
that < a < p such that a^mod p = 1 and a*mod p ^ 1 for < i < N. This construction produces 



row and column indices (i, a J mod p) for placing antennas on an Ax A grid. Note that 



z, a 



i+j 



mod p) 



also produces a Costas array for < j < p. Figure [T] illustrates the antenna configuration generated 
from a Costas array with j = 23, chosen so that the resulting configuration could be augmented 
with one more antenna (see the upper- left panel of Figure [TJ without incurring any redundanc} 



10 



For certain Costas arrays, relaxing the restriction that no two elements share a row or column allows one more 
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Fig. 1. — The top- left panel shows a minimum-redundancy antenna configuration based on an 
N = 36 Costas array (see £3.1). The associated instantaneous zenith-phased coverage of the uv- 
plane is shown in the top-right panel. For comparison, the bottom-left panel shows a 24-antenna 
configuration derived from a Reuleaux triangle that was iteratively optimized to generate uniform 
uw-coverage ( Keto|[l~997 ) . This configuration has been scaled for comparison to the size of a 24 x 24 
Costas array to remove the effect of physical scale on the redundancy metric. The i«;-coverage 
of this configuration (bottom-right panel) highlights the 28 redundant samplings with larger dots. 
Contrast this with Costas arrays, which are perfectly non-redundant and have simple construction 
algorithms for numbers of antennas near prime numbers. The "x" in the top-left panel shows where 
a 37th antenna can be introduced without generating redundant sampling if we relax the constraint 
that Costas arrays must have only one antenna per row and column. The antenna layout in the 
top-left panel, including the additional antenna, is scaled to grid spacings of 3.75 meters and 8 
meters in the configurations labeled min37c and min37, respectively, in Figures [2| [3j |4j and[5j 
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For comparison with other antenna configurations, we generate two realizations of this minimum- 
redundancy configuration: one using 3.75-meter spacing between rows and columns (min37c), and 
one using 8- meter spacing that would be of more practical use for imaging foregrounds (min37). 
The performance of these configurations are compared with maximum-redundancy configurations 
in Figures [4] and [5} Costas arrays efficiently generate minimum-redundancy arrays for imaging, but 
these figures demonstrate that the minimal redundancy of these arrays has negative repercussions 
for power-spectrum sensitivity. 



3.2. Maximum Redundancy Arrays 

As shown in §2.4[ sensitivity may be gained by focusing limited collecting area on specific 
modes of the power spectrum. However, the rising contribution of galactic synchrotron emission 
at large angular scales, the dominance of point-source emission at small angular scales, and the 
expectation that low-order line-of-sight (smooth-frequency) components must be used to suppress 
foregrounds suggest that array configuration must be informed by foreground characterization. 
Compact antenna configurations improve sensitivity by increasing sampling redundancy, motivating 
the centrally-condensed configurations explored for interferometers targeting the 21cm EoR signal 



(Bowman et al. 2006 Lidz et al. 2008). Such configurations, however, suffer from a number of 



practical deficiencies for this application: 

1. The low fringe-rates associated with baselines sampling small \u\ modes impede the discrim- 
ination of celestial signals from instrumental systematics (e.g. crosstalk). 

2. Phase and gain self-calibration are compromised by the lack of longer baselines. 

3. Proximity of antenna elements can cause cross-coupling, producing antenna-specific deviations 
in primary beam response. 

4. Foreground emission peaks in brightness at the small |u|-modes that are most heavily sampled 
by centrally-condensed configurations. 

Phase switching can help mitigate (though not eliminate) crosstalk, and incorporating a rel- 
atively small number of antennas at longer spacings can improve phase and gain calibration from 
point sources. Increasing the spacing between densely packed antennas can substantially decrease 
cross-coupling at the expense of the redundancy generated from earth-rotation synthesis. The fact 
that foreground emission peaks at small \u\ is unavoidable. 



antenna to be placed within the N x N matrix of possible locations, generating new M-u-samples without incurring 
sampling redundancy. Such augmentations can be tested for in a Costas array by first computing the uu-sampling 
matrix for a Costas array (done by convolving the antenna placement pattern with itself) and then convolving the 
result with the original antenna placement pattern. A zero value within the original N x N matrix indicates a location 
where an antenna can be added without increasing sampling redundancy. Many Costas arrays may be created to test 
for augment-ability by trying all valid a and j, as defined above. 
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Fig. 2. — Shown above are the north-south (vertical axis) and east-west (horizontal axis) an- 
tenna positions in meters for various fiducial array configurations. With the exception of min37c, 
these arrangements aim to improve power-spectrum sensitivity in the regime where errors are not 
dominated by sample variance in the cosmological signal by redundantly sampling regions in the 
ttu-plane with many baselines. The different configurations explore different design ideas, includ- 
ing how maximum-redundancy arrays may be generated for regions farther from the center of the 
uv- plane. In contrast, min37c (see Figure [I]) is an array configuration tuned to minimize sampling 
redundancy, thereby improving imaging by maximizing the number of independent measurements 



of the u-u-plane (see £3.1). The itu-sampling patterns generated by these configurations are shown 
in Figure [3} 
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Fortunately, for measuring a physical scale at reionization, a 21cm EoR experiment has consid- 
erable flexibility in choosing an angular scale, making it possible to generate antenna configurations 
that reap many of the sensitivity benefits of centrally-condensed configurations but which avoid 
some of the associated deficiencies by directing sensitivity toward higher- |u| modes. Consider, for 
example, a configuration consisting of two clusters of N/2 closely-packed antennas whose centers 
are separated by a distance larger than the diameter of each cluster (see hex 19x2 in Figure [2]). 
Excluding the central region of the uv-p\ane sampled by intra-cluster antenna pairs, we see in 
b igure [|that N 2 /A samples are concentrated in a region near \u\ = 30. In this region, excluding 



earth-rotation synthesis, we compute from equation B13 that f / fo = N/2. Including redundancy 



generated by the earth's rotation over the course of a two-hour observation toward a transiting 



phase center, we compute f / fo = 1.1 x 10 4 following equation 26 Furthermore, by adjusting the 
spacing between antenna clusters, this configuration can be tuned to focus sensitivity to regions of 
the uv-plane where galactic synchrotron and point-source foreground emission are minimized. 

The hex 19x2 design described above and shown in Figure [2] can be improved upon in several 
ways. Firstly, perturbing the shape of antenna clusters can improve overlap resulting from earth- 
rotation synthesis. Long rows of antennas (see Inl9x2 in Figure [2]) do well for this; as the Earth 
rotates, this sampling pattern slides over itself along the longest axis. As a result, the new uv- 
samples generated are largely redundant with regions that have already been previously sampled 
(see Figure [3]) . It should be noted that the latitude at which an array is deployed influences the 
design of maximum-redundancy array configurations. At latitudes near 45°N/S, the performance 
of the row-based configurations we explore is largely independent of the orientation of the rows. For 
arrays near the equator, rows oriented east-west yield better sensitivity because the spacing between 
rows is maintained through earth-rotation synthesis. In these cases, however, the slow fringe-rates of 
the north-south baselines generated may make them more susceptible to instrumental systematics. 
Motivated by the locations of current 21cm EoR arrays, we have restricted ourselves to considering 
only mid-latitudes. 

An additional optimization relates to exploiting the Hermitian-symmetry of the uu-plane for 
a real-valued sky: a baseline vector u also samples the uu-plane at —u. As a result, for sampling 
the TO-plane at the location of the displacement between rows, the interior rows in antenna ar- 
rangements such as In4x9, In5x7, and In6x6 in Figure [2] are used in two pairings — once with 
each adjacent row. For N antennas arranged into R rows, R — 2 rows are used in both positive 
and negative pairings, generating a peak instantaneous redundancy of N(R — 2)/R samples. This 
alone suggests that the number of rows should be maximized for best sensitivity. However, when 
earth-rotation synthesis is considered, the row length N/R also becomes important. By empirically 
comparing redundancy metrics for In4x9, In5x7, and In6x6, and by including other comparisons 
for larger numbers of antennas, we have determined that the highest-redundancy configurations are 
generated by nearly square arrangements, with R ~ yN. 

In tuning the spacing between rows, there are several competing factors that need to be 
considered. The first is the increasing brightness of galactic synchrotron emission at small \u\ 
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Fig. 3. — The ttf-plane samplings shown above were generated for each of the antenna configura- 
tions from Figure [2j assuming a 2-hour observation at 40° N latitude of a zenith-transiting phase 
center at 150 MHz. Sampling is plotted using uv-b'ms that are 1.5 wavelengths on a side, with 
color scale indicating log 10 of the number of one-second samples falling in each bin, ranging from 3 
(white) to 5 (black). Antenna configurations generating redundant itu-sampling patterns increase 
sensitivity to particular Fourier modes used to probe the A| x (fc) power spectrum at the expense of 
sampling multiple modes. Several of the configurations illustrated above direct sensitivity toward 
regions at higher \u\, thereby avoiding the instrumental systematics and brighter foregrounds asso- 
ciated with sampling near the origin of the uv-plane. The redundancy metrics (see {2. 4) computed 
for these sampling patterns are shown in Figure |4| 
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following & Ce (x £ 37 scaling law (Chen 2004). Secondly, the ability to control instrumental 



systematics decreases with low fringe-rates. The third is that sensitivity reduces with increasing 
spacing owing to baselines moving more quickly through the uv-plane as the earth rotates. The 
fourth, which will be discussed in greater detail in a forthcoming publication, is the fact that the 
frequency-dependence of -uu-sampling becomes increasingly problematic for foreground removal at 
longer baseline lengths. Finally, the increasing dominance of point-source foregrounds at higher |it| 
imply that there will be diminishing returns for reducing foregrounds by increasing baseline length. 
Taken together, these factors imply that the spacing between rows should target the shortest spacing 
at which galactic synchrotron emission and/or instrumental systematics do not pose a problem. To 
standardize the clustered antenna configurations we examine, we choose a 20-meter fiducial spacing 
between clusters. 

Optimizing families of maximum-redundancy configuration styles is straight-forward, algo- 
rithmically. Automating a broader exploration of configuration space for maximally-redundant 
configurations is much more difficult, owing to the extremely low entropy of these states in con- 
figuration space. Our experience has been that random processes are highly unlikely to encounter 
these configurations, even when a strong selective potential is applied. In order to gain confidence 
that the manually-generated maximum-redundancy configurations we explore are at least nearly 
optimal, it is useful to compare them to the total redundancy of compact antenna configurations 
(see hex37 in Figures [2]and[4|. Figure [4] illustrates that in the 10 < < 20 region for which it was 
optimized, the In5x7 configuration achieves approximately 50% of the peak redundancy of hex37. 
Hence, we may have confidence that while other configurations might outperform In5x7, they will 
not do so by more than a factor of two. 

Finally, it is worth pointing out that far from being a calibration liability, maximum-redundancy 
arrays may actually be more conducive to calibration than their minimum-redundancy counterparts. 
Redundant samplings of the itf-plane with many antenna pairings produce independent measure- 
ments of the same quantity, facilitating the calibration of per-antenna gain and phase parameters 



(Liu et al. 2010). Especially for configurations involving shorter baselines sensitive to large-scale 
structures on the sky, the fact that many baselines fundamentally measure the same quantity can 
improve calibration by easing the need for an accurate sky model on which to base a self-calibration 
loop. 



3.3. Sensitivity Performance 

Of the different configurations considered for maximum-redundancy arrays (see Figure [2]), the 
optimal for sensitivity choice depends on the degree to which the shortest baselines are subject 
to instrumental and celestial interference. To parametrize our ignorance of what are the shortest 
baselines that may be effectively used, we introduce a parameter u m i n to describe a minimum cutoff 
for baselines contributing to power-spectrum sensitivity. For the antenna configurations shown in 
Figure [2j with corresponding uv-coverage in Figure [3| we compute the redundancy metric as a 
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function of tt mm , omitting regions inside of |u| < u m [ n from the numerator in equation 26 The 
results are shown in Figure |4j 

If all baselines may be used effectively to measure A| 1 (fc), the most compact configuration 
(hex32 in Figure [2]) maximizes the redundancy metric. If baselines shorter than five wavelengths 
are unusable, however, the array configurations that are most effective employ rows of antennas that 
have high instantaneous sampling redundancy and also generate substantial redundancy through 
earth-rotation synthesis. In particular, we show in the bottom panel of Figure [4] that the In5x7 
configuration dominates all other configurations within the 5 < u m i n < 20 region for which its row 
spacing was tuned. Based on the success of this design, we extrapolate to a 132-antenna design 
consisting of 11 rows of 12 elements each, with a 3.75-meter spacing between antennas within a 
row. This configuration (labeled lnllxl2 in Figure [5]) was shown to have a higher redundancy 
metric than 8 x 16 and 16 x 8 designs. 

Using the redundancy metric read from Figure [4] at a chosen it mm , we can calculate a sensitivity 
as a function of k by applying equation 27 In Figure [5j we plot the sensitivities for selected config- 



urations using observation parameters matched to the PAPER experiment operating at 150 MHz 
with a 6-MHz observing bandwidth, assuming 120-day drift-scan observations over six sidereal hours 
of sky using phase centers spaced two hours apart. We compare these sensitivities with a toy reion- 



ization model derived from equation 21 under optimistic assumptions that produce 100 mK 2 peak 
fluctuations in the range 0.1 < k < lOh Mpc -1 . As shown in Figure [HJ we find that a 132-antenna 
deployment of PAPER antennas will, under ideal conditions, have the requisite sensitivity to detect 
peak 21cm EoR fluctuations at a 3a level at k < 0.25/i Mpc -1 with approximately four months 
of observation. This result is contingent upon antennas being deployed in a maximum-redundancy 
configurations, and does not include the potential effects of foreground removal on sensitivity, which 
will be discussed in a future paper. As shown by the sensitivity curves for minimum-redundancy 
configurations (min37 and min37c in Figure[5]), maximum-redundancy configurations yield nearly an 
order-of- magnitude improvement in sensitivity for 37- antenna arrays. This improvement is larger 
for bigger arrays, owing to the fact that f / fo oc N 2 for maximum-redundancy arrays, whereas 
f / fo = 1 for minimum-redundancy arrays (except when scaling configurations below the critical 
size where adjacent samples of the uv -plane are no longer independent; in this case, redundancy 
scales with maximum baseline length as f j fo oc u~ 4 1 



"max; 



Although these sensitivity figures were computed using observing parameters for the PAPER 
experiment, it is straightforward to extrapolate them to other experiments. In particular, equation 



27 makes very few assumptions about observing strategies that may differ between experiments — 
all such differences are grouped into the numerically computed redundancy factor. PAPER employs 
drift-scan observations that limit observing time toward one phase center to approximately two 
hours. For experiments that track the sky with dishes or with station beam-forming, the amount of 
time spent observing toward the same phase center may be considerably longer. As a result, there 
may be additional redundancy generated by using earth-rotation synthesis over longer periods. In 
all of these cases, computing the sampling generated for a single phase center yields the correct 
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Fig. 4. — Plotted above is the redundancy metric f / fo (relating to sensitivity, see Eq. 26) for 
several of the antenna configurations in Figure [2] as a function of the minimum distance u m j n that 
may be used for power-spectrum analysis, owing to bright foreground emission and instrumental 
systematics associated with low fringe rates (see £3.3). The most centrally condensed configuration 
(hex37) maximizes / if regions within « m i n are not omitted. If a region with radius u m [ Q > 5 
wavelengths is omitted, configurations with larger separations are preferable, most notably In5x7, 
which dominates all other configurations out to twice the separation between antenna rows. For 
comparison, two minimum-redundancy configurations (min37 and min37c, see Figure [TJ are also 
plotted. The lower plot highlights how the number of rows in a configuration affects redundancy. 
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Fig. 5. — This plot shows ideal, la noise-sensitivity levels to the 21cm EoR power spectrum 
using various array configurations at 150 MHz, assuming PAPER observing parameters for 120- 
day drift-scan observations over six sidereal hours of sky with a 6-MHz observing bandwidth (see 



equation 27). Except for hex37 (where ti m i n = is used), the sensitivities shown assume « m i n = 10 



from Figure [4j Minimum-redundancy configurations (min37 and min37c, above) show significantly 
reduced sensitivity relative to the best-performing In5x7 maximum-redundancy configuration. The 
lnllxl2 configuration is an extension of the In5x7 design to 132 antennas, with a 3.75-meter pitch 
between antennas within a row. The thick black line denotes an optimistic toy model for peak 
21cm EoR fluctuations (see equation 21). Predictions for the signal range between this curve and 
a factor of ten smaller. In this plot, k is dominated by the line-of-sight component for the compact 
configurations we consider; at smaller k, sensitivity departs from a power- law as the contribution 
of a baseline's length to k becomes important. 



redundancy value for use in equation 
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A natural question that arises from the efficacy of antenna clustering for improving sensitivity is 
whether, given the density of sampling within a row, it might be desirable to employ larger antenna 
elements (perhaps parabolic cylinders) in lieu of numerous smaller antenna elements. Phrased 
differently, is the 0{N) improvement to SNR that results from implementing a fixed collecting 
area with N antenna elements worth the 0(N 2 ) cost of correlating them? In the limit that the 
correlator is a dominant cost in the construction of an array, using large dishes, beam-forming 
antennas prior to correlation, or even operating separate sub-arrays may all represent attractive 
options for more cheaply improving sensitivity. On the other hand, extrapolating from currently 
deployed systems using Moore's Law applied to computational density suggests correlators might 
not be the dominant cost for forthcoming arrays (Jason Manley, personal communication), in which 
case smaller elements yield the best sensitivity for a fixed collecting area. This case may be even 
stronger, noting that array configurations consisting of parallel rows are particularly conducive to 



0(N 2 ) scaling of the computational cost of correlators with an 0(N logN) scaling based on the 
fast Fourier transform algorithm. 



Reionization experiments aiming to detect the power spectrum of 21cm EoR fluctuations will 
need to achieve a tremendous level of foreground removal. For characterizing these foregrounds, 
minimum-redundancy array configurations are most useful. However, as efforts turn to constrain- 
ing the three-dimensional power spectrum of 21cm EoR fluctuations, the maximum-redundancy 
configurations we have presented provide a substantial increase in sensitivity over their minimum- 
redundancy counterparts in the regime where sensitivity is not limited by sample variance in the 
cosmological signal — a regime that 21cm reionization arrays will find themselves in for the foresee- 
able future. While the most compact antenna arrangements yield the highest-redundancy sampling 
of the uu-plane, the performance of these arrangements may be compromised by increased fore- 
ground brightness at low \u\ and by instrumental systematics associated with low fringe-rates. 

In order to avoid the difficulties of working near the origin of the tif-plane, we show how 
maximum-redundancy array configurations can be tuned to regions of the uu-plane where fore- 
grounds and systematics are likely to be smaller. Of particular interest are a class of configurations 
using parallel rows of antennas that generate substantial instantaneous sampling redundancy, but 
are also aligned to enhance redundancy through earth-rotation synthesis. Using such a configura- 
tion, we demonstrate that under ideal conditions, a 132-antenna deployment of PAPER observing 
for 120 days will have the requisite sensitivity to detect the power spectrum of the brightest ex- 
pected reionization fluctuations at a 3d level at k < 0.25/i Mpc -1 , using bins of Aln k = 1. The 
real-world sensitivity of such an array will be affected by foreground removal requirements and 
actual performance, and could be worse than this. 



correlation via electric-field gridding techniques 




4. 



Conclusion 
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Next-generation 21cm reionization arrays such as phase-II of HERA will have improved sensi- 
tivity. Even so, HERA's choice of array configuration must balance the competing goals of imaging 
bright EoR structures and characterizing the power-spectrum of EoR fluctuations. For current and 
future arrays, this choice must be informed by parameters that are currently poorly constrained: 
the degree to which foregrounds can be modeled, removed, or otherwise differentiated from the 
21cm EoR signal; the angular power spectra of the dominant foregrounds; the nature of instru- 
mental systematics that arise; and the geometry and collecting area of the most effective antenna 
elements. Exploration of these design parameters is underway with phase-I HERA efforts such 
as PAPER and the MWA. PAPER is in a unique position to use the mobility of its antennas to 
explore different configurations for 21cm reionization work. Maximum-redundancy arrays can be 
used to push sensitivity limits for power-spectrum measurements while minimum-redundancy con- 
figurations will help glean more information about foreground properties. Near-term activities can 
explore the results of tuning array sensitivity relative to foreground brightness and examine the 
influence of cross-coupling and crosstalk on power-spectrum measurements. Continued work in this 
area will aim to establish an optimal array design for next-generation 21cm EoR work. 
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A. Sensitivity from Combining Independent /c-modes 



This appendix derives equation 25 in full. We treat each of the three sources of independent 
samples mentioned in {2.3 separately. We begin with equation 20 reproduced here: 

1 3 



AjSf(Jfe) « 2.8 x 10 4 



k 



O.lh Mpc 



-i 



n 



500 K 



120 days 

^days 



0.76 sr 

u\ 



20 



(Al) 



A.l. Combining Modes Along the Line-of-Sight 



Let us bin the line-of-sight modes in logarithmic increments. For a fixed logarithmic bin size 
of Alnfc, the number of modes in each bin grows linearly with k and linearly with the chosen 
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observing bandwidth B. The bandwidth term enters because it sets the total number of k- modes 
measured for a constant frequency channel resolution. We incorporate these scalings into equation 
Al , using that sensitivity will scale as the square- root of the number of independent k- modes 



binned, and counting the modes in a fiducial bin to set the prefactor. For a bin centered around 
k = O.lh Mpc -1 , a bin of width Aln/c has bin edges at O.OGh and 0.165/i Mpc -1 . A bandwidth 
of 6MHz produces a fc-space resolution of 2tt/YB & 0.083/t Mpc -1 , where Y is given by equation 
Therefore, we count approximately 1.27 independent fc-modes, resulting in a \/L27~ 1.13-fold 



18 



increase in sensitivity, or 



A^j(fc) » 2.48 x 10 4 
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(A2) 



A. 2. Combining Time Samples and Modes Across the ra-plane 

To combine modes measured by different baselines throughout the itu-plane and to calculate 
the amount of time a baseline samples a single (u, v, 77) mode, we must assume an array configura- 
tion. As stated in §2.3[ we assume an array configuration that generates uniform, non-overlapping 
coverage in the uv -plane out to a radius tt max - This assumption makes the problem algebraically 
tractable, and is similar to the minimum-redundancy arrays discussed in § |3.1[ We explicitly sum 
measurements from (u,v,rj)-mode bins, or "i«;-pixels" , over the uv-pl&ne. We use the calculated 



the noise in any uv-pixel from equation A~2 and add up all the samples within each ring of constant 



itu-distance \u\. Finally, we sum over all the rings out to u mscx . 

Let us define two additional terms. Let the sampling density of the uf-plane, p, be given by 

N 2 



ITU 



2 



^ 7ru ma,x 



(A3) 



where iVbi is the number of baselines and is the number of antenna elements. This equation 
restates our assumption that the uv-plane is uniformly sampled out to a radius u max . We also 
define iper_mode 5 the time a baseline samples a single uu-pixel relative to a chosen phase center 
before earth rotation moves it into another pixel: 



^per_mode — ^20 
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"20" 







H 



(A4) 



where £20 is the amount of time a 20- wavelength baseline spends in one pixel (used a fiducial scale) , 
fio is a fiducial primary beam size (0.76 sr for PAPER), and \u\ is the baseline length. Note that, 
per the assumptions of the derivation in §2.3[ we neglect here the possibility that multiple baselines 
may sample the same w-pixel under earth- rotation synthesis. Note also that £20 depends on the 
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array latitude; for PAPER it is approximately 13 minutes; this factor has already been absorbed 



into the prefactor in equation 20 



With these terms defined, we first calculate the number of w-pixels within a ring of radius u. 
We choose an arbitrary ring width w, which will drop out of the derivation later. The number of 
baselines that sample within this ring is then: 



iVbi ~ 2ir\u\wp. 



(A5) 



The number of pixels sampled depends on the observing time, i per _day and the time spent in each 

pixel, iperjiiodc 

£per_day 



px,rmg 



bi 



(A6) 



_^per_mode _ 

To calculate the sensitivity of one ring in the w-plane, we average over the sensitivity of each pixel 
within the ring. Each pixel within the ring has equal sensitivity (e.g. equation A2), so this is a 
simple, unweighted average: 



£A 2 
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N,px 



A 2 

N,px 



where A 2 ^ px is given by equation 



A2 



px a/ A px 
Plugging in values calculated above gives: 



(A7) 
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The next step is to combine all the measurements from different rings. In the case presented 

i| 2 term in the denominator of equation 



A8 



here, the noise power in each ring is equal, as the 
cancels the \u\ term in equation A2 We can therefore do another unweighted average to combine 
the rings: 
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-\/A r i n g S sj U max / W 

where the last step uses the fact that the number of rings is the radius of the circle over the width 
of a ring, u r 



i /w. Plugging equations A2 and A3 into this equation yields: 
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We then reach our final result by substituting in equation ~K5 choosing a fiducial observation time 
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of t per _day = 6 hours (recall that £20 ~ 13min), and choosing an array size of N = 32 elements: 
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B. Sensitivity from Combining Redundant Samples of fe-modes 



In this appendix, we first motivate our definition of the redundancy factor presented in equation 



26 in B.l Next, in B.2 we derive equation 27 in full. Finally, we present an analytic calculation of 



the redundancy metric for a filled circular aperture in B.3 



Several times below we will refer to sensitivity relative to a fiducial measurement. For this 
benchmark, we choose a single baseline with a one-second integration, which we call lsec . The 
choice of fiducial integration time is arbitrary, but affects the scaling constants in the equations that 
follow. For baselines with length |u| < 10 4 (essentially all baselines useful for EoR measurements), 
earth-rotation is unimportant on one-second timescales. Therefore, our fiducial measurement is 



equal to equation 16 



^N,lsec(^) 



X 2 Y 



k rr\L 

2^2t sys ' 



(Bl) 



with t = 1 second. 



B.l. Motivation for the Redundancy Metric 

In this section, we outline a formalism for computing the sensitivity of a generic antenna array, 
expressed in terms of a redundancy metric. The sensitivity calculation in §2.3| and Appendix [A] 
assumed a nu-coverage that produces equal sensitivity in any ring of the nu-plane. We were then 
able to perform an unweighted average to get the final sensitivity by summing over rings. More 
generally, the final sensitivity will be a weighted average of the sensitivity of all the uv -pixels: 

X>« A N,i 

= (B2, 

i 

where i is an index labeling an individual uv-pixel and ; is the noise variance of a mode. The 
optimal weights for any pixel are proportional to the inverse variance of noise in that pixel. Since 
repeated measurements of a uf-pixel add coherently in temperature, redundant measurements beat 
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down the noise in temperature-squared linearly with the number of samples. Therefore, we choose 
optimal inverse- variance weights wi = nf , where rii is the number of fiducial samples in that pixel, 



and equation B2 then becomes: 
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We further simplify this equation by noting that the coefficients of a weighted sum of random 
numbers will add in quadrature: 
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We compare this result to the reduction in noise that would occur without any two samples 
being of the same mode (i.e. being uniformly unity): 
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where the index j labels pixels. (We use a different letter here to differentiate these modes from 



those used in equation B4). The number of samples has remained constant between this case and 



the one calculated above, E 1 = E n *> giving us: 
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The relative improvement of redundant sampling over the completely non-redundant case is 
the ratio between these two terms: 
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We use this result to motivate the definition of our metric for the sampling redundancy given in 

(B8) 



equation 26 
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B.2. Derivation of Maximum-Redundancy Sensitivity 

To derive array sensitivity including the effects of sampling redundancy, we begin by evaluating 



equation Bl at z = 9, substituting in equation 19 
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Expressing the result in terms of our fiducial observing and telescope parameters gives: 



AL S ec(*0 ~ 2.6 x 10 9 
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Next, we incorporate several sensitivity contributions calculate d pr eviously: a factor of 1.13 for 



logarithmic binning of line-of-sight modes (derived in Appendix A.l), a factor of (2.16 x 10 4 )z for 
the number of independent 1-second observations in a 6-hour observing window, and a factor of 120 
for the number of days observed. There is also a factor of (-/Vaselines) 5 sensitivity increase, since 
each baseline provides an independent sample at every integration. For our fiducial array of 32 
antennas, this term is ~ V512. The result is an expression for the sensitivity of an array, assuming 
that every integration of every baseline is treated as a sample of an independent fc-mode: 
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Finally, as derived in Appendix |B.l the effects of sampling redundancy may be included by intro- 
ducing a factor of [///o] 5 - Using a fiducial value of f / fo = 10 4 , we have our final result: 
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B.3. Calculating ///o for a Filled Circular Aperture 

Here we explicitly calculate the instantaneous redundancy of an array where antennas are 
arranged to uniformly sample an aperture within a region defined by \r\ < R, with zero sampling 
elsewhere. Using that the convolution of two disks of area ttR 2 is a cone of height h = irR 2 and 
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base radius 2R, we can compute: 
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Using that h = ttR 2 = NA e , where N is the number of antennas and A e is the effective area of a 
single antenna, we have / = NA e /2 = Nfy. In general, the redundancy metric must be calculated 
numerically to account for more complicated array configurations and the effects of earth-rotation 
synthesis. 
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